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OO ■ Abstract 

A state-dependent degraded broadcast diamond channel is studied where the source-to-relays cut is 
^1 , modeled with two noiseless, finite-capacity digital links with a degraded broadcasting structure, while the 

relays-to-destination cut is a general multiple access channel controlled by a random state. It is assumed 
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that the source has non-causal channel state information and the relays have no state information. Under 
this model, first, the capacity is characterized for the case where the destination has state information, 
i.e., has access to the state sequence. It is demonstrated that in this case, a joint message and state 
transmission scheme via binning is optimal. Next, the case where the destination does not have state 
information, i.e., the case with state information at the source only, is considered. For this scenario, lower 
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and upper bounds on the capacity are derived for the general discrete memoryless model. Achievable 



rates are then computed for the case in which the relays-to-destination cut is affected by an additive 
Gaussian state. Numerical results are provided that illuminate the performance advantages that can be 
accrued by leveraging non-causal state information at the source. 
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I. Introduction 

We consider a communication channel in which the source wishes to communicate to the 
destination via the help of two parallel relays and there is no direct link between the source 
and the destination, as shown in Fig. [TJ The first hop, from the source to the relays, consists 
of two noiseless digital links of finite capacity: a common link of capacity C\ (bits per channel 
use) from the source to both relays and a private link of capacity C2 (bits per channel use) 
from the source to relay 2. The first hop has thus a degraded broadcast channel (BC) structure. 
The second hop, from the relays to the destination, is a general multiple access channel (MAC) 
controlled by a random state [1]. It is assumed that (?) the entire state sequence that affects the 
MAC is known to the source before transmission, (ii) the state is not available at the relays, 
and (Hi) it may or may not be known at the destination. We term this channel model as the 
state-dependent degraded broadcast diamond channel (SD-DBDC) with non-causal channel state 
information (CSI) at the transmitter (i.e., CSIT) and with or without CSI at the receiver (CSIR). 

The motivation to study this channel stems from the downlink of a distributed antenna system, 
in which a central unit controls two antennas, e.g., two pico-base stations, via backhaul links, 
for communication to an active user over a wireless channel, see for example [0. The backhaul 
communication may be received by both antennas over a wireless broadcast channel modeled 
by C\, or received by one of antennas via a dedicated optical fiber cable modeled by C2. In 
such a system, the state may model the fading coefficients for the MAC between the distributed 
antennas and the user, or an interference signal affecting this MAC. In the first case, the user can 
typically measure the fading channels of the MAC, thus obtaining CSIR, while the central unit 
may be informed about such fading channels, e.g., via dedicated feedback links, thus obtaining 
CSIT. The pico-base stations, serving as the relays, are not expected to decode the feedback 
signal from the user, due to a design choice or insufficient signal-to-noise ratio, and thus CSI is 
assumed to be unavailable at the relays. In the latter case of an interfering signal affecting the 
MAC, the interference signal may be communicated to the central unit via backhaul links from 
the interfering transmitters, e.g., another central unit, thus obtaining CSIT, while relays and the 
user are not informed, thus having no CSIR. 



A. Background and Related Work 

The diamond channel, in which a source communicates to two relays via a general broadcast 
channel and the relays are connected to the destination via a general state-independent MAC, 
was introduced by Schein and Gallager in [0 and has been widely studied ever since. For the 
discrete memoryless (DM) diamond channel, several achievability results were established in 
0, while for the Gaussian case, it was shown by fl4] that partial-decode-and-forward relaying 
achieves a rate within one bit of the cut-set bound. Despite all the activity, the capacity of this 
channel in general is open except for some particular instances j5]|-[|7]|. 

A relevant special case of the diamond channel is obtained when the BC in the first hop is 
modeled as two orthogonal, noiseless digital links of finite capacity. We refer to this model as 
orthogonal broadcast diamond channel (OBDC). The OBDC was first studied by Traskov and 
Kramer in [51, where upper and lower bounds on the capacity of the DM OBDC were derived. 
Recently, Kang and Liu [9] proposed a single-letter upper bound for the OBDC with a Gaussian 
MAC and established the capacity for a special subclass of Gaussian OBDCs. The SD-DBDC 
studied here is related to the OBDC, with the differences that the first hop is modeled as a 
degraded noiseless broadcast channel and that the MAC in the second hop is state-dependent. 

A comprehensive review of previous work on channels with states can be found in ifTOll . while 
the discussion here focuses only on work directly related to the present contribution. Consider 
first a system as in Fig. [T] but with a single relay and with the relay having full knowledge of 
the message intended for the destination. Note that in this case, the source-to-relay link, unlike 
the SD-DBDC, only carries state information and not the message. This channel, which can be 
seen as a point-to-point system with coded CSIT, was studied by Heegard and El Gamal in ifTTI 
under the assumption of CSIR. Therein, a general lower bound was derived and shown to be tight 
for some special cases. In 021, Cemal and Steinberg studied the extension of this single-relay 
setting to the case with two relays, under the assumption that the relays are informed about the 
two independent messages to be delivered to the destination and that there is full CSIR. This 
model can be seen as a MAC with coded CSIT. Assuming that the source-to-relays links are 
modeled as in Fig. \T\ with degraded noiseless channels, the capacity region for this model was 
characterized. Additionally, inner and outer bounds on the capacity region were derived for the 
case where the source-to-relays cut consists of separate noiseless links. A related work is also 



that of Permuter et al. [Q~3), which derived the capacity region for a MAC where the encoders, 
i.e., the relays of Fig. [Q are connected by finite-capacity links to one another, and the MAC 
channel depends on two correlated state sequences, each known to only one encoder, and there 
is full CSIR. 

We now focus on related studies that assume no CSIR. For the set-up with a single relay 
and where the relay is informed about the message, i.e., the coded CSIT problem, an upper 
bound on the capacity was found in reference [Q3) and proved to be achievable in some special 
cases. It is noted that, if the relay was informed about both state and message, the optimal 
strategy would be Gel'fand-Pinsker (GP) encoding JT], which reduces to Dirty Paper Coding 
(DPC) lfT5l in the corresponding Gaussian model with an additive state. The state-dependent 
MAC with various form of CSIT and no CSIR has been studied in |[T6l - ll2T1l . Assuming non- 
causal CSIT, the capacity regions for such MAC models are still unknown except the following 
special instances: the MAC with one informed encoder and degraded messages lfT8l : the binary 
MAC with two additive state sequences, each known to one encoder [1201 ; and the Gaussian 
MAC with a common state known to both encoders |[T6l [|22l Chapter 7]. Relay channels with 
state have also been investigated with various type of state information at the nodes, see for 
example, Il23l - ll25l . In particular, in reference lt24l . Zaidi et al. studied a single relay channel 
with non-causal CSI at the source and proposed various achievable schemes. Capacity results 
were also identified for some special cases [|24]. 

B. Contributions 

In this paper, we study the SD-DBDC model illustrated in Fig. [Q with non-causal CSIT and 
with or without CSIR. Our contributions are summarized as follows: 

. For the DM SD-DBDC with non-causal CSIT and CSIR, we find the capacity. The key 
ingredient of the achievability is a form of binning inspired by |fT3l . whereby the source 
selects directly the codewords to be transmitted by the relays in such a way as to adapt them 
to the given realization of the state sequence. It is demonstrated, similar to [fl"3l . that such 
a joint message and state transmission scheme from the source to the relays is optimal and 
that it generally outperforms a simple scheme whereby the source sends separate message 
and state descriptions to the relays, see Section HHl 



• For the DM SD-DBDC with non-causal CSIT and no CSIR, we first derive an upper bound 
on the capacity and then propose two transmission strategies. The first proposed strategy 
operates by sending separate message and state descriptions over the digital links to the 
relays so as to allow each relay to perform GP coding against the quantized state sequence 
it reconstructs. We refer the scheme to as GP coding with quantized states (GP-QS) at 
the relays. The second scheme, inspired by [|24), [|2~6l . instead works by having the source 
first encode the message via GP coding as if the relays had perfect message and state 
information. Then it sends one common description of the resulting GP sequence to both 
relays and one refinement description to relay 2. We refer this scheme to as quantized GP 
coding (QGP). The corresponding lower bounds are derived and presented in Section HV-B I 
to liV-Ct 

• For the case with non-causal CSIT and no CSIR, we also study the Gaussian SD-DBDC 
with an additive Gaussian state. Achievable rates based on the proposed GP-QS and QGP 
schemes are evaluated. Numerical results illuminate the merits of non-causal CSIT at the 
source node and demonstrate the relative performance between the GP-QS and QGP schemes 
for the Gaussian SD-DBDC, see Section ITV-Dl 

Notation: We denote the probability distribution of a random variable X as pxix) = Pr[X = 
x], or as p(x) when the meaning is clear from the context. Notation x l represents vector 
[xi,...,Xi\. For an integer L, the notation [1 : L] denotes the set of integers {1,...,L}; for a 
positive real number /, the notation [1 : 2 l ] denotes the set of integers {1,..., [2']}, where [.] 
is the ceiling function. Af(0, a 2 ) denotes a zero-mean Gaussian distribution with variance a 2 . 
Finally, C(x) is defined as C(x) = | log 2 (l + x). 

II. System Model and Main Definitions 

In this section, we introduce the model studied in this work. Specifically, the SD-DBDC model, 
depicted in Fig. [Q is denoted by the tuple (Ci,C 2 , X\ x X 2 x S,p(y\xi,x 2 ,s),y), where Ci 
and C 2 are the capacities in bits per channel use of the common link from the source to both 
the relays, and the private link from the source to relay 2, respectively, X\ and X 2 are the two 
input alphabets, S is the state alphabet, y is the output alphabet and p(y\xi,x 2 ,s) represents 
the channel probability mass functions (PMFs) describing the MAC between the relays and the 
destination. The state sequence s n is generated in an independent and identically distributed 
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Fig. 1. A state-dependent degraded broadcast diamond channel (SD-DBDC) with non-causal channel state information (CSI) 
at the transmitter (CSIT) and with or without CSI at the receiver (CSIR). The CSIR switch is closed or open, respectively. 



(i.i.d.) fashion according to a fixed PMF p(s), i.e., 

p(s n ) = \lp(s l ). 



(1) 



i=l 



The channel is memoryless in the usual sense and the entire state sequence s n is assumed to be 
non-causally known to the source node, i.e., we assume non-causal CSIT. However, sequence 
s n may or may not be available at the decoder, i.e., we may or may not have CSIR. 

Let W be the message that the source wishes to send to the destination, which is uniformly 
distributed over the set W = [1 : 2 nR ]. We define the code as follows. 

Definition 1: A (2 nR ,n) code for the SD-DBDC consists of: 

1) An encoding function at the source node 

/:Wx5 n 4[l: 2 nCl ] x [1 : 2 nC % (2) 

which maps the message and the state sequence into two indices M\ and M 2 transmitted 
over the source-to-relay s links; 

2) Two encoding functions at the relays 



h:[l: 2 nCl ] -> X?, 
and h 2 : [1 : 2 nCl ] x [1 : 2 n ° 2 ] -» X?, 



(3) 
(4) 



that map the information received by each relay, namely Mi by relay 1 and (Mi, M 2 ) by 
relay 2, into the corresponding sequences transmitted by the two relays; 



3) A decoding function at the destination. For the case of no CSIR, we have 

9 ■ y n -+ W, (5) 

which maps the received sequence into a message estimate W £ W, while with CSIR, we 
have 

g : y- x S n -» W, (6) 

which maps the received sequence and the state sequence into a message estimate W £ VV. 

The average probability of error, P e , is defined as P e = PrfW ^ W]. A rate R is achievable 

if there exists a sequence of codes (2 nR , n) as defined above such that the probability of error 

Pe — > as n — > oo. The capacity C of this channel is the supremum of the set of all achievable 

rates g3. 

III. Non-causal CSIT and CSIR 

In this section, the capacity is established for the DM SD-DBDC with non-causal CSIT and 
CSIR. The capacity-achieving transmission scheme is presented in Section ITH-Al For comparison, 
a straightforward transmission strategy is also considered and its suboptimality is then shown in 
Section HlFBl 

A. Capacity Result 

The achievability is based on a scheme in which the source encoder directly selects the 
codewords to be transmitted by the relays so as to adapt them to the given realization of the 
state sequence. This is accomplished via a strategy, inspired by lfl3ll . in which the codebooks for 
the transmitted signals X™ and X%, are binned so that the bin index is identified by the message 
to be delivered to the destination, and the codewords within the bin are chosen to match the 
state sequence. Moreover, given the degraded broadcast channel between source and relays, the 
codebooks for X™ and X£ are superimposed, so that the codeword for X™ is known at both 
relays, while the codeword for X% is only transmitted, superimposed on X™, by relay 2. The 
following theorem presents the result. 



Theorem 1: For the DM SD-DBDC model with non-causal CSIT and CSIR, the capacity is 
given by 



^ C 1 + C 2 -I(X 1 ,X 2 ;S), 



C = maxmin 

v 



C X -I(X 1 ;S)+I(X 2 ;Y\X 1 ,S] 
yi(X l ,X 2 ;Y\S) 
with the maximum taken over the distributions in the set 



(7) 



subject to 



V = {p(s,x 1 ,x 2 ,y) :p(s)p(x 1 ,x 2 \s)p(y\x 1 ,x 2 ,s)} (8) 

C 1 >I(X 1 ;S), (9) 

fmdC 1 + C 2 >I(X 1 ,X 2 ;S). (10) 

Proof: We provide here a sketch of the proof of achievability. Details are provided in 
Appendix |A[ along with the proof of converse. Let e 2 > e\, and define functions 5(ei) and 8(e 2 ) 
such that 5(ei) — > as €\ — > and S(e 2 ) — > as e 2 — > 0. The source splits message w G [1 : 2 nR ] 
into two independent parts w\ G [1 : 2 n/?1 ] and u? 2 G [1 : 2 ni?2 ]. Message twi is associated with 
a bin Bx{w\), that contains 2 n ( /( ^ Xi; ^ +<5 ^ 1 ^ i.i.d. generated codewords indexed by x^(wi,li), 
with lx G [1 : 2 n ( / ( Xi;5 ) +<J ( ei ))], while message u> 2 is associated with a bin B 2 (w 2 \wi, h) 
for all pairs (wi,li), that contains 2 n ^( X2;5 l Xl ) +l5 ( e2 ^ i.i.d. generated codewords indexed by 
x 2 (w 2 ,l 2 \wi,h), where l 2 G [1 : 2' 1 ( / ( X2;S 'l Xl ) +<5 ^ 2 ^]. Given a message pair w = (w 1 ,ty 2 ) and 
a state realization s n , the source encoder first looks for an index l± G [1 : 2 n ( / ( Xi; ' s ^ +<5 ^ 1 ^] such 
that codeword £™ (u>i, /i) G £>i(u>i) is jointly typical with s n ; it then looks for an index l 2 G [1 : 
2 n(/(x 2 ;5|x 1 )+5(e 2 ))j such that coc ieword X2(w 2 ,l 2 \w 1 ,l 1 ) G i3 2 (w 2 |wi,/i) is jointly typical with 
(xi(wi, li), s n ). Thus, index mi = (iui, /i), is conveyed to both relays and index m 2 = (w 2 , l 2 ), 
is only conveyed to relay 2 over the digital links. Upon receiving the index and retrieving its 
corresponding components, relay 1 forwards x"(u>i, h) and relay 2 forwards x 2 (w 2 , l 2 \wi, /i) to 
the destination. Observing the output sequence ?/ n and the state sequence s n , the decoder chooses 
a unique tuple of (u)i, w 2 ,lx,l 2 ) such that (x" (t&i, fi), x 2 (w 2 , l 2 W\, /i ), s n , y n ) are jointly typical. 
In this way, the final message estimate w is uniquely determined by w\ and u) 2 . ■ 



B. The Suboptimality of Separate Message-State Transmission 

In the capacity-achieving scheme discussed above, the source encoder selects the codewords 
for the relay directly based on both message and state sequence in a joint fashion. One can 
consider, for comparison purposes, a scheme in which the source encoder sends message and 
state information to the relays separately. The suboptimality of such an approach for a related 
model was discussed in [fT3l . We emphasize, however, that, while related, the model considered 
here is not subsumed by, nor does it subsume, the model in [[T3l . 

To elaborate, assume that the source splits the message as w = (wi,w 2 ), as done above, 
and describes the state sequence using a successive refinement code (Si,S 2 ) ll28l . where Si 
represents the base state description and S2 represents the refined description. Message wi and 
state description Si are sent to both relays, while message w 2 and state description S 2 are sent 
only to relay 2. A coding scheme, similar to that of Theorem 1 of lfT2l . can now be devised 
in which message w\ is transmitted by using a codebook, conditioned on the description Si, 
while message w 2 is encoded by relay 2, superimposed on the codeword encoding w\ and is 
conditioned on state descriptions (Si, S 2 ). The corresponding achievable rate is characterized as 



-^separate = HiaX mm 
r -pi 



/ Ci + C 2 -I(S u S 2 ;S), \ 

C\ — I (Si; S) + I(X 2 ; Y \Xi, S, Si, S 2 ), 
1 I(Xi,X 2 ;Y \S, Si, S 2 ) , 



(11) 



with the maximum taken over the distributions in the set 

V = {p(s,si,s 2 ,Xi,x 2 ,y) :p(s)p(si,s 2 \s)p(xi \si)p(x 2 \xi,si,s 2 )p(y\xi,x 2 ,s)} (12) 

subject to 

Ci>J(Si;S), (13) 

and Ci + C 2 > /(Si, S 2 ;S), (14) 

where the alphabet size of Si is bounded as |«Si| < |S| + 3 and the alphabet size of S 2 is 
bounded as \S 2 \ < \S\ (\S\ + 3) + 2, by standard cardinality bounding techniques [[221 Appendix 
C]. Note that the constraints ([T3l and ([T4l represent the well-known conditions that allow the 
construction of a successive refinement code with test channel p(si, s 2 \s) ll28l . 

We now show that we have in general -R scpar atc < C and that this inequality can be strict. 
In particular, for a fixed p(s) and channel PMF p(y \xi,x 2 ,s), considering any PMF in the set 



10 

V of <[I2>, we have the following Markov chains: S - S x - X l3 S - (S x , S 2 ) - (X 1 ,X 2 ) and 
(Si, S 2 ) — (S,Xi,X 2 ) — Y. Based on these chains, we have the following inequalities 

Ci>I(Si;S)>I(Xi,S), (15) 

Ci + C 2 > I(S U S 2 ; S)>I (Xi, X 2 , S) , (16) 

I(X 2 , Y \Xi, S, Si, S 2 ) 

= H(Y\Xi,S,Si,S 2 )-H(Y\Xi,X 2 ,S,Si,S 2 ) 

= H(Y\Xi,S,Si,S 2 )-H(Y\Xi,X 2 ,S) 

<I(X 2 ;Y\Xi,S), (17) 

&nAI(Xi,X 2 ;Y\S,Si,S 2 ) 

= H(Y\S,Si,S 2 )-H(Y\Xi,X 2 ,S,Si,S 2 ) 

= H(Y \S, Si, S 2 ) - H(Y \Xi,X 2 , S) 

<I(Xi,X 2 ;Y\S), (18) 

which imply that -R S c P aratc < C . We now show with an example that this inequality can be strict. 
For the example, we consider the special case of our model obtained with C\ — and X x 
taken as a constant, so that the model reduces to the two-hop line network, consisting of the 
source, relay 2 and the destination (studied also in [fT3l , see Fig. 2 of |[T3l if R 2 = and 
p(y\xi,x 2 ,s) = p(y\x 2 ,s)). Inspired by the binary example considered in [fT3l in a slightly 
different context, we then concentrate on the binary model described by 

Y = SX 2 © Z, (19) 

where the state S ~ Bernoulli(^), the noise Z ~ Bernoulli (p 2 ) with p z = Pr[Z — 1] 6 [0, |], 
independent of 5", and © denotes the modulo-sum operation. We further impose a cost constraint 

n 

on the binary input X 2 at relay 2 as i Yl E [X 2j i] < p X2 with p X2 G [0, |], where E[.] denotes 
the expectation operation. The capacity of this binary example can be derived from Theorem [[] 
along with the additional input constraint and is given by 

C 2 - H b (\(p Q +pi)) + \H b (p,) + \H b (pi), 
C = max mm ( | , (20) 

\H b (pi *p z ) - \H b (p z ) 
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lgo for C2 = 0.5, and p X2 =0.1 or 0.3 in the binary 



subject to constraints H b (\(p G + pi)) - \H b (p Q ) - \H b (pi) < C 2 and \{p Q +p t ) < p X2 , where 
po = Pr[X 2 = 1 \S = 0] e [0, 1], px = Pt[X 2 = l\3 = l]e [0, 1], H b (p) = -phg 2 (p) - (1 - 
p) log 2 (l — p), and "*" denotes the convolution operation, e.g., p\ *p z = p\(l —p z ) + (1 — p\)p z - 
Similarly, rate -R sep arate can be obtained from (fTTT ). We also consider a special case of the 
"separate" scheme, in which only message information is sent to the relays, so that we set Si, 
S 2 to a constant in (fTTT) (rate -R pU rc- message in the figure). 

Numerical results are provided in Fig. |2] where C, -R scpa rate and -R pur e-messagc are plotted 
versus p z for C 2 = 0.5, p X2 = 0.1 or 0.3, and the cardinality of S 2 is assumed to be m = 2 in 
-^separate (increasing m to 3, 4 or 5 did not boost the numerical rates of -R scpa ratc)- It is clearly 
seen that C strictly improves upon -R scp aratc and the latter strictly outperforms -R pure - message for 
a wide range of p z values in this example. 



12 

IV. Non-causal CSIT and No CSIR 

In this section, we turn to the SD-DBDC with non-causal CSIT and without CSIR. In the 
absence of CSIR, the capacity is difficult to establish. In the following, we thus first present 
an upper bound on the capacity and then illustrate two achievable schemes for the DM model 
in Section IIV-AI to IIV-CI Results are then extended to a Gaussian SD-DBDC with an additive 
Gaussian state in Section HV-Dl 

A. An Upper Bound 

Proposition 1: For the DM SD-DBDC model with non-causal CSIT and no CSIR, the capacity 
is upper bounded by 



/ r c f 1 + <7 2 -J(X 1 ,X a ;S'), ^ 



R upp = maxmin 

"Pupp 



(21) 



C 1 -I(X 1 ;S) + I(X 2 ;Y\X 1 ,S), 
yi(U;Y)-I(U;S) J 

with the maximization taken over the distributions in the set 

^u PP = {p{s,u,x 1 ,x 2 ,y) :p(s)p(u\s)p(x 1 ,x 2 \u,s)p(y\x 1 ,x 2 ,s)}. (22) 

Proof: Since the capacity with CSIR cannot be smaller than without CSIR, the first two 
bounds follows from the converse proof of Theorem [Q The third bound in (I2TI) is instead obtained 
by providing message and state information to the relays and thus the proof can be derived as 
in [0Q with the identification of auxiliary random variable as Ui = (W, S™ +1 , Y % ~ 1 ). ■ 

B. Achievable Scheme 1: GP Coding With Quantized States At The Relays 

In the absence of CSIR, the source can provide information about the state to the relays so 
as to allow the latter to perform GP coding. Following this idea and an appropriate combination 
of message splitting, superposition coding and successive refinement coding ll28l . similar to the 
discussion in the previous section, we can devise a scheme detailed below, which is referred to 
as GP coding with quantized states (GP-QS) at the relays. The GP-QS leads to an achievable 
rate given as follows. 
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Proposition 2: For the DM SD-DBDC model with non-causal CSIT and no CSIR, a lower 
bound on the capacity is given by 



/ 



R 



GP-QS 



max mm 



C\ + C 2 — I (Si, S 2 ; S), 



\ 



d - I(S l ; S) + I(U 2 ; Y \U, ) - I(U 2 ; S 1} S 2 \U, 
y I(U X , U 2 ; Y) - I(U i; S x ) - I(U 2 ; S ll S 2 \U 1 ) 
with the maximum taken over the distributions in the set 

p(s,s 1 ,s 2 ,u x ,u 2 ,xi,x 2 ,y) : 
p(s)p(si,s 2 \s)p{ui \si)p(u 2 \UI,S!,S 2 ) 
p(xi \u 1 ,sx)p(x 2 \xx,ux,u 2 ,s 1 ,s 2 )p(y\xi,x 2 ,s) 



(23) 



Pi 



(24) 



subject to 



andI(S u S 2 ;S)<d + C 2 . 



(25) 
(26) 



Sketch of Proof: The proof follows from rather standard arguments, and thus it is only 
sketched here. Let e 2 > e l5 and define functions S(ei) and S(e 2 ) such that S(ei) — > as ei — V 
and S(e 2 ) — > as e 2 — > 0. As done in the "separate" strategy discussed in the previous section, 
the source encoder splits message w 6 [1 : 2 nR ] into a common message W\ E [1 : 2 nRl ], to 
be delivered to both relays, and a private message w 2 G [1 : 2 nR2 ], to be delivered to relay 
2 (so that w = (wi,w 2 )). Moreover, a successive refinement code (Si,S 2 ) is used to describe 
the state sequence, where the description Si, of rate R Sl , is delivered to both relays, and the 
description S 2 , of rate R S2 , which refines the first, is communicated only to relay 2. As discussed 
around conditions (fT3l and (fl4)) . the following conditions guarantee the existence of a successive 
refinement code with test channel p(s\, s 2 \s) 



R S1 >I(S X ;S), 
and R S2 > I(S 2 ; S \S\ 



(27) 
(28) 



Moreover, in order to guarantee the successful delivery of the messages and state descriptions, 
the following conditions are sufficient 



Ri + R Sl < d, 
and i?i + R Sl + R 2 + R S2 < d 



d 



(29) 
(30) 
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Given the messages and quantized state sequences, GP coding is performed by the relays. 
Specifically, an auxiliary codebook of 2 n ( iJl+/ ( c/i;5l ) +5 ( ei ^ i.i.d. codewords u" is generated, and 
then partitioned into 2 nRl bins indexed by Bi(wi), where w 1 E [1 : 2 nRl ]. Using superposi- 
tion coding, for each codeword u™(wx,lx), where l\ E [1 : 2 n ( 7 ( c/i;Sl ) +<5 ( ei ))] is the index of 
the codeword u™ in the bin Bi(w{), a second auxiliary codebook of 2 n ^ R2+I ^ U2 ' Sl ' s ^ Ul ^ +s ^ e2 ^ 
i.i.d. codewords U2(w 2 ,l 2 \wx,h) is generated, and then partitioned into 2 nR2 bins indexed by 
B 2 (w 2 \w 1 ,l 1 ), where w 2 E [1 : 2 nR2 ] and l 2 e [1 : 2 n ^ I( - U2 ' Sl ' S2 \ u ^+^ e2 ^} is the index of 
the codeword u 2 l in the bin B 2 (w 2 \wx, lx). With these codebooks, GP coding of a message 
w = (wx,w 2 ) takes place as follows. Relay 1 and relay 2 encode W\ via the selection of 
a codeword Ui(wx,lx) that is jointly typical with the common quantized state sequence s™. 
Then, relay 2 encodes message w 2 by choosing a codeword uV, (w 2 , h \wx, h ) jointly typical with 
(m"(wi, li), s™, S2). Appropriate channel inputs x\\ and x 2 are then formed by relay 1 and relay 
2, respectively, based on the binning codeword(s) selected and the available quantized state(s). 

At the destination, upon observing the channel output y n , the decoder looks for a unique 
pair of (u™(wi/li),U2(w 2 /l 2 Wxjx)), that is jointly typical with y n , and assigns the message 
estimate as w = (wi, w 2 ). If none or more than one such pair is found, an error is declared. By 
the packing lemma [22, Chapter 3], it is shown that the probability of decoding error vanishes 
if 

R2 + I(U 2 ; Sx, S 2 \Ux ) < I(U 2 ; Y \U X ), (31) 

and Rx + I(Ux; Sx) + R 2 + I{U 2 ; S ± , S 2 \U t ) < I(Ux, U 2 ; Y). (32) 

Finally, combining the constraints above and using the Fourier-Motzkin procedure [|2~2l Ap- 
pendix D] to eliminate (R Sl ,R S2 ) and then (Rx,R 2 ) completes the proof of achievability. ■ 

C. Achievable Scheme 2: Quantized GP Coding 

In the GP-QS scheme, a separate description of state and message is conveyed to the relays. 
Based on the results with CSIR, one might envision that a scheme in which selection of the 
relays' codewords is done directly at the source based on both message and state information 
could be instead advantageous. One such scheme is described here. As further discussed below, 
however, without CSIR, this scheme is generally not optimal and might even be outperformed 
by the "separate" GP-QS strategy. 
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In the second scheme proposed here, inspired by [1241 . [|26l , GP coding is done by the source 
encoder, as if the source encoder had direct access to the relays. Given the finite-capacity link 
between source and relays, the source encoder then quantizes the resulting GP sequence using a 
successive refinement code, and conveys a common description to both relays and a private 
description to relay 2. Upon receiving the descriptions and hence having the reconstructed 
sequences, the relays simply forward them to the destination. Observing the channel output, 
the decoder looks for a GP codeword that is jointly typical with the received sequence, and 
obtains the message estimate as the index of the bin to which such codeword belongs. This 
scheme is referred to as the quantized GP coding (QGP). It leads to the following achievable 
rate. 

Proposition 3: For the DM SD-DBDC model with non-causal CSIT and no CSIR, a lower 
bound on the capacity is given by 

R QGP = max(I(U; Y) - I(U; S)) (33) 

V2 

with the maximum taken over the distributions in the set 

{p(s,u,v,xi,X2,y) ■ 
Fy ' ' ,yJ \ (34) 

p(s)p(u \s)p(v \u,s)p(x 1 ,x 2 \v )p(y \x x , x 2 , s) 

subject to 

I{X X -V)<C X , (35) 

andI{X 1 ,X 2 ;V)<C 1 + C 2 . (36) 

Remark 1: The proof of the proposition follows from the discussion above and standard 
arguments [HI, [|2~8l and hence details are omitted for brevity. In the achievable rate derived, 
we remark that as in [Q], U n denotes the auxiliary binning codewords, while V n denotes the 
(auxiliary) analog input sequence, produced by GP encoding at the source encoder. A common 
description of V n is carried via both X" and X% , a private one is carried via X% only. Inequalities 
d35l)— d36l) impose the rates at which the descriptions can be generated. The rate (l33l is the rate 
achievable by GP coding on the virtual channel that connects the source to the destination. □ 

Remark 2: While a general performance comparison between the GP-QS and QGP schemes 
does not seem to be easy to establish, it can be seen that when the link capacities are arbitrarily 
large, either the state sequence or the GP analog sequence can be perfectly conveyed to the relays, 
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•*w 



Fig. 3. A Gaussian SD-DBDC with an additive Gaussian state. 



and thus both the GP-QS and QGP schemes achieve the upper bound (I2T1) . and specifically the 
third bound in (|2T|) . thus giving the capacity. □ 



D. Gaussian SD-DBDC 

We now study a Gaussian SD-DBDC as depicted in Fig. [3l In particular, we assume that the 
destination output Y> at time instant i is related to the channel inputs X^, X 2ji at the relays and 
the channel state & as 



Yi — Xn + X 2i + Si 



(37) 



where Si ~ A/"(0, Pg) and Zj ~ J\f(0,N ), are i.i.d., mutually independent sequences. The 
channel inputs at the relays satisfy the following average power constraints 



1 n 

i=l 



(38) 



The encoding and decoding functions are defined as in Definition [Q except that the codewords 
are required to guarantee the input power constraints (l38l . 

1) Reference Results: For reference, we first consider the performance of a simple scheme 
that does not leverage the non-causal CSIT In particular, the source splits again the message w 
into two independent parts w = (u>i,u>2) and sends w\ at rate R\ to both the relays and w 2 at 
rate R 2 to the relay 2 via the digital links. In this way, the model at hand is converted into a 
Gaussian MAC channel with degraded message sets [|29l , ll30l . The decoder simply treats the 
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state as noise. The maximum message rates supported by the first hop are given by: i?x < C\ 
and R\ + R 2 < C\ + C 2 , while the capacity region for the MAC cut is obtained from [|30ll as 

:i-p 2 )P2 



Ri<C 



No + Ps 



Ri + R 2 <C 



P 1 + P 2 + 2pJT\P 2 



No + Ps 

for < p < 1, where we recall that C(x) is defined as C(x) = |log 2 (l 
overall achievable rate without using CSIT is given by 



(39) 

(40) 
x). Therefore, the 



/ 



R 



no SI 



max mm 

0<p<l 



Ci + C 2 , 



n I P 1 +P2+2pVT\Fh 

N +Ps 



\ 



c 



{1-P 2 )P2 

No+Ps 



Ci 



\ 



/ 



(41) 



which serves as a natural lower bound for the capacity of our example considered. 

A simple upper bound R^ pp can be instead obtained by providing the decoder with the 
interference sequence so that it can be cancelled. The capacity region of the corresponding 
state-independent system can be found from [30] and is given by (I4TT) with N in lieu of N + Ps. 

2) Achievable Rates: We now apply the GP-QS and QGP schemes discussed above to the 
given Gaussian model. 

Proposition 4: For the Gaussian SD-DBDC model, the following rate is achievable by the 
GP-QS scheme: 

/ r C 1 + C 2 -ilo g2 (g) J \ 

n ( (VPT+VpP^) 2 
'- ^ pPz+Dt+No 

where p = 1 — p and the set of A is defined as 

A = {(D U D 2 ) :P s >D 1 >D 2 >0,D 1 > P S 2~ 2C \D 2 > p s2 - 2 ^+ c ^} . 



-"-GP-QS 



max 

0<P<1, 
(D!,D 2 )eA 



mm 



-N 



\ 



c (_m_\ 

^ \D 2 +NoJ ) 



(42) 



(43) 

Proof: Note that the result of Proposition |2] can be extended to the continuous channel by 
standard techniques [|22l Chapter 3]. Thus, one can obtain the achievable rate in this proposition 
through evaluation of the general result therein by identifying appropriate inputs. Details of the 
proof are provided in Appendix |B] We remark that (X? l5 D 2 ) in (|43T) represent the distortions at 
which the state S is described to the two relays via the successive refinement code (Si, S 2 ) used 
in GP-QS. ■ 
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Next, we derive the achievable rate based on the QGP scheme. 

Proposition 5: For the Gaussian SD-DBDC model, the following rate is achievable by the 
QGP scheme: 

/ ,2 \ 



-^QGP — C 



(y/Pi(l ~ 2- 2C *) + yJP % (1 - 2-^+^)))' 



(44) 



R9-2C1 _|_ p n-2(Ci+C 2 ) ( 1 i O / Pi(l-2- 2p l) \ yw- 

/ lZ + 2 \ v i + / YP 2 (l-2-2(Ci+C 2 ))J + iV . 

Proof: The proof is obtained from Proposition |3] similar to the proof of Proposition 0] (see 
Appendix Q. ■ 

Remark 3: As the digital link capacity Ci becomes arbitrarily large, it is easy to see that 
both schemes GP-QS and QGP attain the upper bound R^ pp , leading to the capacity C = 
q ( P1+P2+2V P1P2 \ Note that the capacity is the same as if the interference at the destination 
was not present and if full cooperation was possible at the relays. The benefit of utilizing the 
non-causal CSIT is therefore evident from this example. We also emphasize that letting capacity 
C2 alone grow to infinity is not enough to obtain the upper bound above, as in this case only 
relay 2 can be fully informed by the central unit. □ 

Remark 4: The achievable rate -Rgp-qs °f scheme GP-QS is generally dependent on the 
interference power P s , while the achievable rate Rq GP of scheme QGP is not. This is because 
in the GP-QS scheme, the state sequence needs to be described to the relays on the finite-capacity 
links, and thus the stronger is the power Ps of the interfered state, the larger are the feasible 
distortions (P 1; D 2 ) in (1431) for reproducing the state sequence at the relays. As a result, in the 
extreme case in which the state power P s becomes arbitrarily large, the rate -Rqp-qs reduces to 
rate R^ Q SI (|4T|) obtained when the decoder simply treats the state as noise. On the other hand, 
in the QGP scheme, the source compresses directly the appropriate GP sequence, whose power 
does not depend on P s . Given the fact that the performance of QGP is not dependent on P s , 
it is expected that the QGP scheme outperforms the GP-QS scheme in case Ps is sufficiently 
large. □ 

3) Numerical Results: We now further investigate the performance of the proposed schemes 
via numerical results. We first fix the digital link capacities as C\ = 1.5 and C2 = 1. We 
also set Pi = P2 — P — 1, and vary N so that the signal-to-noise ratio (SNR), defined as 
SNR = 101og 10 (P/A r ), lies between [-10 : 30] dB. Fig. Hand Fig.[5]illustrate the corresponding 
achievable rates versus SNR, given Ps = 0.2 or 0.4, and Ps = 0.8 or 1.2, respectively. It can 
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Fig. 4. Achievable rates R vs. SNR for C\ = 1.5, C 2 = l,Pi = Pi = 1, Ps = 0.2 or 0.4. 



be seen that with a small state power P s , e.g., Ps = 0.2 as in Fig. rate -Rgp-qs °f scheme 
GP-QS improves upon rate P„ SI of the simple scheme without using CSIT, while rate Pq GP 
of scheme QGP is smaller than both. This is due to the fact that, when P s is relatively small, it 
is more effective to describe the state sequence to the relays, as done with GP-QS. In the case 
of moderate P s , e.g., P$ = 0.4 as in Fig. we observe that both the GP-QS and QGP schemes 
outperform the simple scheme. In the case of moderate-to-strong Ps, e.g., Ps = 0.8 or 1.2 as in 
Fig. [51 as explained in Remark scheme QGP is generally advantageous over scheme GP-QS. 
We now plot in Fig. [6] the achievable rates versus C\, for C 2 = 1, Pi = P 2 = 1, N = 0.1 
and P s = 1.2. It can be seen that, when C\ is large enough, both the GP-QS and QGP schemes 
attain the upper bound R^ pp , hence giving the capacity, as discussed in Remark Next, the 
achievable rates are plotted versus P s in Fig. [71 for fixed link capacities C\ = 1.5, C 2 = 1 and 
P\ = P 2 = 1, iV = 0.1. This figure further confirms the discussion in Remark @] by showing 
that both rates Pqp-qs an d Pno si decrease as Ps increases. 
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Fig. 5. Achievable rates R vs. SNR for Ci = 1.5, C 2 = 1, Pi 



l,Ps =0.8 or 1.2. 



V. Conclusions 

In this work, we have studied a state-dependent diamond channel, in which the broadcast 
channel between source and relays is defined by a noiseless degraded broadcast channel, and 
the multiple access channel between relays and destination is state-dependent. For the case with 
non-causal channel state information at the transmitter (CSIT) and at the receiver (CSIR), we 
have established the capacity and shown that a joint message and state transmission scheme 
via binning is optimal and superior to the scheme that performs separate message and state 
description transmission. For the case without CSIR, we have proposed an upper bound and two 
transmission schemes, and applied the results to a Gaussian model with an additive Gaussian 
state. For the Gaussian model, numerical results demonstrate the merit of the non-causal CSIT, 
and indicate that the best available transmission scheme generally depends on the power of 
the state. The capacity for the case without CSIR remains open in general and serves as an 
interesting problem for future work. 
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Appendix A 
Proof of Theorem Q] 

Throughout the achievability proofs in the paper we use the definition of a strong typical 
set ||22|. In particular, the set of strongly jointly e-typical sequences ||22|| according to a joint 
probability distribution p(xy) is denoted by T™(XY). When the distribution, with respect to 
which typical sequences are defined, is clear from the context, we will use T e n for short. 

Achievability: 

Codebook generation: Fix a joint distribution p(s)p(xi,x 2 \s)p(y\xi,x 2 , s) where p(s) and 
p(y \xi,x 2 , s) are defined by the channel. Let R = Ri + R 2 , R\ > Rx > and R 2 > R 2 > 0. 

n 

Randomly and independently generate 2 nRl i.i.d. x" sequences, each according to Yl p( x i,i) an d 

8=1 

then partition them into 2 nRl bins J3\{w\), with w\ 6 [l : 2 niil ]. Hence, there are 2 n ^ 1_iil ) x™ 
codewords in each bin, which are indexed by x"(tfi, Zi) with Ij G [1 : 2 n ( iJl_iJl )]. Moreover, for 

n 

any given x" (twi, /i), generate 2 ni?2 i.i.d. Xg sequences, each according to \~[ p(x 2:i Ixi^^i, h) ) 

«=i 
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Fig. 7. Achievable rates R vs. P s for C\ = 1.5, C 2 = 1,-Pi = Pa = 1, JV = 0.1. 



and then partition them into 2 nii2 bins B 2 (w 2 \w\,l\), with w 2 E [l : 2 ni?2 ] . Hence, there are 
2n(R2-R2) x n codewords in each bin, which are further indexed by x 2 (w 2 ,l 2 \wi,h) with l 2 E 
1 : 2 n( - R2 ~ R2 ^ . Reveal the whole codebook generated to all parties involved. 
Encoding: Let e^ > e 2 > e\, and define functions S(ek) such that 5(efe) — > as e^ — > 
for fc = 1, 2, 3. The source encoder splits message w E [l : 2 nR ] into two independent parts 
W\ E [l : 2 nRl ~\ and ai 2 6 [l : 2 ni?2 ]. Message wi is associated with each bin Bi(wx), while 
message w 2 is associated with each bin B 2 (w 2 \wi, h) for any fixed (w\, l\). Given the message 
pair (wi, w 2 ) and non-causal state information s n , the source encoder first looks for a codeword 
Xi(wi,h) E Bi(wi) such that (xi(wi,h),s n ) E T™(XiS); if there are more than one, choose 
the first one according to the lexicographic order; if there is none, set l x — 1. Given the 
Xi(wi,l\) found, the source encoder further looks for x 2 (w 2 ,l 2 |u>i,Zi) E B 2 (w 2 |u>i,/i) such 
that (x 2 (w 2 , l 2 \w\, h),Xi(wi, /i), s n ) E T™(X 2 XiS); if there are more than one, choose the first 
one according to the lexicographic order; if there is none, set l 2 = 1. Then the source conveys 
index vri\ = (wi,li) and index m 2 = (w 2 ,l 2 ) to the relays via the digital links. In particular, 



23 

index m x is intended for both relays and m 2 only for relay 2. Upon receiving the index and 
retrieving its corresponding components from the source, relay 1 transmits x^(w 1 , l{), while relay 
2 transmits £2(^2,^2 \wi,h) to the destination. 

Decoding: Given (s n ,y n ), the decoder looks for a unique tuple of (t&iJi, w 2 , 1 2) such that 
(xi(wi,h),x%(w2,l2 Wi,li),s n ,y n ) e T™(X 1 X 2 SY); if there is none or more than one such 
tuples, an error is reported. Then the final message estimate is assigned as w = (wi,W2). 

Analysis of probability of error. Without loss of generality, assume that w = {w\, w 2 ) = (1,1) 
is sent by the source and the indices conveyed to the relays are Mi = (1, L±) and M 2 = (1, L 2 ). 
The analysis of probability of error mainly follows from the covering lemma and the packing 
lemma |[22l Chapter 3]. Specifically, by the covering lemma, given any typical sequence s n , the 
source encoding error vanishes as n — > 00 if 

R 1 -R l >I(X 1 ;S) + 5(e l ), (45) 

and R 2 -R2> I{X 2 ; S \X X ) + 5(e 2 ). (46) 

Moreover, the indices Mi and M 2 can be perfectly conveyed to both relays and relay 2, 
respectively, as long as the digital link capacities satisfy 

Ri < C x , (47) 

and R 1 + R 2 <C 1 + C 2 . (48) 

By the packing lemma, the probability of decoding error event {(w±, h) ^ (1, Li), for all (w 2 , h)} 
vanishes as n — > 00 if 

R 1 + R 2 < I(X 1 ,X 2 ;Y,S)-5(e 3 ). (49) 

Similarly, the probability of decoding error event {(w\, h) = (1, L\), (w 2 , ^2) 7^ (1,-^2)} vanishes 
as n — > 00 if 

R 2 <I(X 2 ;Y,S\X 1 )-5(e 3 ). (50) 

Finally, combining the above conditions (1451)— (l50l) and using the Fourier-Motzkin procedure to 
eliminate (i?i, R 2 ) and then (R 1 ,R 2 ) completes the proof of achievability. 

Converse: Let Mi be the common index conveyed to both relays and M 2 be the private index 
conveyed to relay 2 only. First, considering the digital link capacity constraint, we have that 

nCi > H{M X ) (51) 
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>I(M 1] S n ) (52) 

= J2HMi;S i \S i - 1 ) (53) 

2=1 

II 

= J2l{M 1 ,S i - 1 ,X lti] S i ) (54) 

i=l 

II 

>^J(X 1:i ;^), (55) 



i=l 



where (l54l) holds because of the facts that Si is independent of S l x and X 1:i is a deterministic 
function of M t . By the same reasoning, we can show that 

n(C 1 + C 2 )>H(M 1 ,M 2 ) (56) 

>I(M 1} M 2 ;S n ) (57) 

>Y,HX u ,X 2X ,S l ). (58) 

i=l 

We can also write 

nR = H(W) (59) 

<I(W;Y n \S n )+ne n (60) 

n 

= ^/(^;F,|F^ 1 ,5") + ne n (61) 






^ [#(y, l^" 1 , 5") - ff (y« (r 1 " 1 , S n , W, Mi, M 2 , X lti , X 2 ,)] + ne n (62) 

2=1 

II 

J2[H(Y i \Y i - 1 ,S n )-H(Y i \S i ,X 1A ,X 2A )]+ne n (63) 



i=i 



<^2l(X lti ,X 2li - i Y i \3 i )+ne n (64) 



i=l 



with e n — > as n — > oo, where (1601 ) is due to Fano's inequality, i.e., H(W \Y n , S n ) < ne n ; (1621 
holds because (Mi, M 2 ) is a deterministic function of (W, S n ), X\^ is a deterministic function of 
Mi and X 2ji is a deterministic function of (Mi, M 2 ); (1631) follows from the memory less property 
of the channel; and (|64l) follows from the fact that conditioning reduces entropy. 
Next, we can prove a second bound on the rate as 

nR = H(W) (65) 
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H(W\S n ) (66) 

H(W,M x ,M 2 \S n ) (67) 

if (Mi, M 2 ) - J(Mi, M 2 ; S n ) + if (W |Mi, M 2 , S n ) (68) 

Jf(M 1 ,M 2 )-^/(M 1 ,M 2 ,X li4 ,X :M ,^- 1 ;^) + ii(iy|M 1 ,M 2 ,5 n ) (69) 



it 



< n{C x + C 2 )-J2 J (^M> x 2>i - Si) + H(W \M U M 2 , S n ) (70) 

i=l 
it 

= n(d + C 2 )~Y, I(Xi,i, X 2ti ; Si) + H(W \M X , M 2 , S n , Y n ) (71) 

i=l 

it 

< n(C x + C 2 )-J2 HX^i, X 2ii ; S,) + ne n (72) 

with e n — > as n — V oo, where (|66T ) is due to the independence between W and S* n ; (l67l) holds 
because (M X ,M 2 ) is a deterministic function of (W, S n ); (|69l follows from the facts that Si is 
independent of S 1 ^ 1 , X Xii is a deterministic function of M x and X 2) , is a deterministic function 
of (Mi,M 2 ); (|7Q|) follows because of the capacity constraints on the links between source and 
relays, and because of the chain rule and the non-negativity of mutual information; (TTTb holds 
due to the Markov chain W - (M u M 2 , S n ) - Y n so that I(W; Y n \M U M 2 , S n ) = 0; and CZS 
follows from Fano's inequality. 

Moreover, we have the third bound 

nR = H(W) (73) 

= H(W,M 1 \S n ) (74) 

= H(M 1 )-I(M l ;S n ) + H(W\M u S n ) (75) 

v, 
<nd-J2 HXi,i] S^ + H(W \M X , S n ) (76) 

i=l 

v, 

<nC t -J2 1(Xi,i\ Si) + I(W; Y n \M h S n ) + ne n (77) 

i=l 

n 

= nC 1 -J2 I (X hl ;S l ) 

i=l 

ri 

+ J2[H(Y l \Y i -\M 1 ,S n ,X 1 ,)-H(Y l \Y i - 1 ,M 1 ,S n ,W,M 2 ,X 1 „X 2 ,)]+ne n (78) 
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nC ^ -J^HXi^Si) +Y / [H(Y i \Y i - 1 ,M 1 ,S n ,X lti ) - H{Y i \S i ,X 1 , i ,X 2 , i )] +ne n 

i=l i=l 

(79) 



< nCl - J2 1(Xi,i; S z ) + J2 1 (X2, l ; Yi \X hi , Si ) + ne n (80) 

i=l i=l 

with e n — )• as n — > oo, where lines d74l to (TToT ) are obtained by similar reasonings for lines (1661 ) 
to (1701) in the previous bound; (1771 ) is due to Fano's inequality, i.e., H(W \Y n , S n , Mi) < ne n ; 
(1781) holds by the chain rule and also because M 2 is a deterministic function of (W,S n ), X^ 
is a deterministic function of M\ and X 2ii is a deterministic function of (M 1; M 2 ); (1791 follows 
from the memoryless property of the channel; and (|80l holds due to the fact that conditioning 
reduces entropy. 

Finally, let Q be a random variable uniformly distributed over the set [1 : n] . Define random 
variables S = S Q , X x = X ljQ , X 2 = X 2>Q and Y = Y Q . Then, bounds (O, d58]), ([64]), dTJ) and 
(l80l) can be written as 



C 1 >I(X 1 , Q ;S Q \Q) = I{X 1 ;S\Q), (81) 

C\ + C 2 > I(X ltQ ,X2, Q ; S Q \Q) = I(X 1 ,X 2] S\Q), (82) 

and 

R - e n < I(X ltQ ,X 2<Q ;Y Q \S Q ,Q) = I(X 1 ,X 2] Y \S,Q), (83) 

i? - e n < (d + C 2 ) - I(X ltQ ,X 2)Q ; S Q \Q) = (d + C 2 ) - 7(X 1? X 2 ; 5 |Q), (84) 
it! - e n < Q - J(Xi )0 ; Sq |Q) + I(X 2 , Q ; F Q |S Q , X liQ , Q) 

= d-I(X 1 ;S'|Q) + I(X 2 ;Y|£,X 1 ,Q), (85) 

where the distribution on (Q, S, X±, X 2 , Y) from a given code is of the form 

p(q, s, xx,x 2 , y) = p{q)p{s)p(xi, x 2 \s,q)p(y \xi, x 2 ,s). (86) 

To eliminate the variable Q from bounds (I8TT ) to (l8~5l . we note that 

I(Xi',S\Q) 

= H(S\Q)-H(S\X 1 ,Q) (87) 

= H(S)-H(S\X 1 ,Q) (88) 
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>I(Xi;S), (89) 

where (|88l ) follows from the fact that the symbols Si with i E [1 : n] are i.i.d. and hence S = Sq 
is independent of Q. Similarly, we can prove that 

I(X 1 ,X 2 ;S\Q)>I(X 1 ,X 2 ;S). (90) 

Moreover, the inequalities 

I(X 1 ,X 2 ;Y\S,Q) <I(X 1 ,X 2 ;Y\S), (91) 

tmdI(X 2 ;Y\S,Xi,Q) < I{X 2 ;Y\S,X 2 ), (92) 

hold because of the Markov chain Q — (Xi,X 2 ,S) — Y. Given the facts above, the bounds 
corresponding to (T7T> — (fTOl) are recovered by noticing that the distribution of the random variables 
(S, Xi,X 2 ,Y) obtained by marginalizing (l8~6l) over Q is of the exact form given in V of ®. 
This concludes the converse proof and also the proof of Theorem [TJ 

Appendix B 
Proof of Propositions] 

Based on the GP-QS scheme described in Section IIV-BI and whose achievable rate is given 
by (T2~3l— (l26l). for state encoding, we consider the following cascade of backward channels: 
S = S 2 + Z 2 = (Si + Zi) + Z 2 , where S ± ~ AT(0,P 5 - Di), Z x ~ jV(0,£>i - D 2 ) and 
Z 2 ~ J\f(0,D 2 ), are independent, and Ps > Di > D 2 > 0. This construction implies the 
Markov chain: S^ — S2 — 5 1 . Hence, we have that 

7(5 1 ;5) = itog a ^ ) (93) 

and J(5 l5 5 2 ; 5) = J(5 2 ; 5) = ^ log 2 (^ . (94) 

And the constraints of (1251) and (|26|) become 

£>i > Ps^ 2Cl , D 2 > p s 2~ 2(Cl+C2) . (95) 

For message encoding, we let X\ ~ A/"(0, Pi), independent of (S, Si, S 2 ); X 2 = </ £ p-Xi + V 2 , 
where < p < 1, and V 2 ~ A/"(0, pP 2 ) is also independent of (5 1 , Si, S 2 ). The auxiliary random 
variables Ui and C/ 2 are defined as 

U 1 =X 1 + a 1 S 1 , (96) 
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U 2 = V 2 + a2 \S 2 -a 1 \l + J^\ SA , (97) 

for some a x ,a 2 > to be specified later. Note that, with these choices, the channel output Y 
becomes 

Y = X 1 + X 2 + S + Z (98) 



l + \/jT }X 1 + V 2 + S + Z (99) 



Pi 

Therefore, with the choice of U\ given above, we have that 



' X 1 + V 2 + S 1 + Z 1 + Z 2 + Z. (100) 



, { u^)-m^)<c{^£^-), doi) 

where the equality is achieved by setting 

k + y/W)* 

a[ = V 2 V ' (102) 

(l + y^) P 1 + pP 2 + D 1 + N 

in (l96l) . which is such that a\{Y — Si) is the minimum Mean-Square-Error (MSE) estimate of 
X\ given Y — Si, similar to Costa's DPC [fT5l . Next, to decode the private message carried over 
U 2 , the decoder subtracts f 1 + y^jr ) U\ from Y obtaining the received signal 



J^jr I 



Now, with the choice of U 2 in (1971) , we have that 

I{U 2 -Y\U x )-I{Si,S^U 2 \U x ) (104) 



= I(U 2 ; Y')-ll U 2 ; S 2 - a* ( 1 + ^ ^ j Si j (105) 

< C f^rl , (106) 



^D 2 + iV 
where the equality is achieved by setting 

m 

pP 2 + D 2 + N 
This concludes the proof. 



«2 = ^ ,V , Ar - (107) 
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Appendix C 
Proof of Proposition \5\ 

Based on the QGP scheme described in Section IIV-CI and whose achievable rate is given by 
(1331)— (l36l). we let the auxiliary random variable V ~ A/"(0, P v ) for some P v > 0, independent 
of S. Consider the following cascade of forwarding channels: X 2 = V + Z 2 , and X x = c\\X 2 + 
Z u where X x ~ A/"(0,Pi) and X 2 ~ J\f(0,P 2 ); Zi ~ Af{0,crf), Z 2 ~ JV(0,crf), which are 
independent of each other and also of V; Parameters a x , of and o| are to be specified. Following 
this construction, note that X x — X 2 — V forms a Markov chain. Therefore, the constraint of 
(1361) becomes /(Xi, X 2 ; V) = I(X 2 ; V) = \ log 2 (p.) < C x + C 2 . Thus, one can choose u\ = 
p 2 2-2(Ci+c 2 ) Then5 p v _ p 2 ^ _ 2 -2(Ci+c 2 )j due t0 the power constraint on X 2 . Moreover, 

noting that a^P 2 + a^ = Pi and | log 2 ( -^rrp, ) < C\ due to constraint (l35l ). we thus choose 

ol = 1 1 _ 2 ^2(c 1 +c 2 ) and ocx = J - ./l_ 2 -l {Cl +c2)) ' The auxiliar y random variable C/ is defined 
as [/ = V + /?*£, where /?* is chosen to be the weight of the minimum MSE estimate of 
V given Y — S = X\ + X 2 + Z, similar to Costa's DPC [15]. In this way, the message rate 
RG Qp = j(f/ ; y) - I(U; S) = I(V; X x + X 2 + Z) which equals ©. This completes the proof. 
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